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THE  CEMS  IV  OAP  ALGORITHM 


by 

H.  J.  Larson 
T.  Jayachandran 


The  Air  Force  Oil  Analysis  Program  (OAP)  uses  spectrometric  analyses 
of  used  oil  to  monitor  the  levels  of  metallic  contaminants  in  the  lubricat¬ 
ing  systems  of  aircraft  engines  and  other  types  of  equipment.  Oil  sarrples 
are  analyzed  at  more  or  less  regular  intervals  (every  10  operating  hours, 
after  every  flight),  allowing  observation  of  the  temporal  evolution  of  the 
contaminants  being  monitored.  These  temporal  observations  are  used,  in 
turn,  to  recommend  special  maintenance  actions  when  required,  generally 
triggered  by  "high"  levels  of  one  or  more  contaminants. 

During  fiscal  year  1982,  the  Air  Force  contracted  for  the  development 
of  the  Comprehensive  Engine  Management  System  (CEMS),  phase  IV.  This  sys¬ 
tem  is  to  operate  at  two  different  levels: 

(a)  it  is  to  gather  maintenance  and  other  data  at  the  level  of  the 
individual  base,  making  this  data  available  at  the  base  level  to 
aid  in  maintenance  decisions. 

(b)  a  central  data  bank  is  to  be  located  at  Tinker  Air  Force  Base, 
bringing  together  data  from  all  the  individual  bases. 

The  OAP  data  generated  by  an  Air  Force  base  is  to  be  one  of  the  data 
elements  in  the  base  level  CEMS  IV  system.  Because  of  this,  upon  the  adop¬ 
tion  of  the  CEMS  IV  system  it  is  planned  that  every  Air  Force  oil  analysis 
laboratory  will  be  equipped  with  a  computer  terminal,  linking  it  to  the 
base's  CEMS  IV  computer,  allowing  real  time  processing  of  the  oil  analysis 
data. 

The  use  of  regression  methodology  to  aid  the  OAP  decision  making  proc¬ 
ess  has  been  suggested  several  times  (see  references  [1]  through  [6]).  The 
main  drawback  to  implementing  this  type  of  approach  has  been  the  data- 
analytic  requirements  of  such  a  system.  Many  of  the  OAP  laboratories  main¬ 
tain  a  heavy  workload  of  sample  analyses  and  are  not  equipped  to  handle 
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extensive  number  crunching  prior  to  giving  a  laboratory  recommendation,  based 
on  the  numbers  produced  by  the  spectrometric  analysis  of  a  used  oil  sample. 
Availability  of  real  time  data  processing,  and  a  direct  link  between  the 
spectrometer  and  the  CEMS  IV  computer,  will  remove  this  drawback. 

For  any  aircraft  participating  in  OAP,  used  engine  oil  samples  are  re¬ 
moved  on  a  regular  basis.  The  frequency  of  such  samples  depends  on  the 
aircraft  type  involved.  Some  aircraft  are  sampled  after  every  flight,  some 
every  flying  day,  some  every  10  flight  hours,  etc.  In  every  case  the  used 
oil  sample  is  delivered  to  an  oil  analysis  laboratory  where  it  is  analyzed 
on  a  spectrometer.  The  spectrometer  produces  a  digital  readout  of  the  part 
per  million  content  of  any  of  20  different  contaminants;  generally  the  lab¬ 
oratory  only  analyzes  for  those  metals  that  are  in  contact  with  the  lubri¬ 
cating  oil  in  the  engine  sampled.  In  current  practice,  the  reading  for 
each  metal  of  interest  is  subjected  to  two  comparisons  defined  by  the  Tech¬ 
nical  Order  (T.O.)  table  for  the  specific  engine  sampled.  The  T.O.  table 
gives  a  trend  value  and  a  number  of  range  values,  for  each  contaminant  of 
interest.  The  range  values  provide  absolute  limits  for  the  amount  of  the 
contaminant,  regardless  of  how  long  it  has  been  since  the  last  sample  was 
taken  or  since  the  oil  was  changed.  Thus  for  iron,  for  example,  the  table 
may  say  the  normal  range  is  0-12  ppm,  the  marginal  range  is  13  to  15  ppm, 
the  high  range  is  16  to  18  ppm  and  the  abnormal  range  is  19  ppm  and  over. 

The  spectrometer  operator  then  observes  where  the  current  iron  reading 
falls,  relative  to  these  values,  and  notes  the  appropriate  recommendation. 

The  trend  value  is  used  to  judge  the  change  in  contaminant  level  from 
the  preceding  sample  to  the  current  reading.  For  iron,  say  the  T.O.  trend 
value  is  4  ppm.  The  spectrometer  operator  must  then  take  the  iron  reading 
for  the  current  sample,  subtract  the  iron  reading  for  the  previous  sample, 
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multiply  by  10  and  then  divide  by  the  change  in  operating  hours.  This 
computation  gives  an  estimate  of  the  rate  of  change  of  the  iron  contamina¬ 
tion,  standardized  to  a  10  hour  operating  period.  If  the  computer  value  is 
no  more  than  4  (for  this  example)  the  sample  has  a  normal  trend;  otherwise 
the  T.O.  recommendation  is  to  declare  the  trend  abnormal  and  appropriate 
recommendations  are  made,  based  on  the  range  and  trend  results. 

The  oil  sample  received  by  the  spectrometri c  laboratory  is  removed 
from  the  aircraft's  oil  sump  while  the  engine  is  still  hot,  hopefully  en¬ 
suring  that  the  fluid  in  the  sump  is  homogeneous  and  that  the  contamination 
in  the  sample  removed  is  representative  of  that  in  the  sump.  The  sample  is 
placed  in  a  small  bottle,  labelled  to  identify  the  aircraft,  date  and  cumu¬ 
lative  hours  since  the  oil  was  changed,  and  then  is  sent  to  the  laboratory. 
At  the  laboratory  the  sample  is  well  mixed  and  then  a  small  part  of  the  oil 
in  the  bottle  is  burned  by  the  spectrometer  to  produce  the  ppm  contaminant 
counts. 

The  counts  produced  by  the  spectrometer  are  in  fact  observed  values  of 
random  variables.  This  apparent  randomness  in  the  readings  is  caused  by  a 
number  of  factors,  including  the  following.  If  the  oil  in  the  same  sample 
bottle  is  analyzed  two  or  more  times,  the  counts  produced  are  not  the  same. 
This  variation  in  readings  is  caused  by  voltage  fluctuations,  temperature 
variations,  actual  length  of  time  the  spectrometer  uses  to  burn  the  oil, 
variations  in  the  actual  contamination  contents  in  the  small  amounts  burned, 
etc.  From  a  broader  view,  if  a  second  sample  were  removed  from  the  aircraft 
sump  it  is  quite  likely  that  the  actual  contamination  levels  in  the  two 
bottles  are  not  in  fact  identical  and,  of  course,  both  may  differ  from  the 
actual  contamination  levels  of  the  sump  itself,  the  quantity  of  interest. 
Earlier  studies  ([3],  [4])  have  shown  that  the  readings  produced  by  the 
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spectrometer  appear  to  be  well  described  by  a  normal  (Gaussian)  distribution. 
Additionally,  it  has  been  shown  ([2],  [3],  [4])  that  different  serial  num¬ 
bers  of  the  same  type  of  equipment  appear  to  present  individual  signatures 
for  contaminant  buildup  in  the  oil  sump,  even  though  they  are  presumably 
"identical"  in  construction. 

Previously  the  only  useful  environmental  variables  readily  available 
to  the  technician  in  making  oil  analysis  recommendations  were  the  number  of 
flight  hours  since  the  oil  was  changed  and  the  number  of  hours  since  over¬ 
haul.  With  the  advent  of  CEMS  IV  a  new  environmental  variable  is  now 
available  from  other  maintenance  sources:  whether  or  not  fresh  oil  was 
added  to  the  sump.  Our  major  task  for  this  year  was  to  provide  an  imple- 
mentable  statistical  algorithm  to  aid  the  operator  in  making  recommenda¬ 
tions.  This  algorithm  was  to  take  into  account  (minimize  the  effects  of) 
the  random  noise  of  the  spectrometer,  the  number  of  flight  hours  since  oil 
change  and  the  oil  addition  records. 

For  the  A-10  aircraft,  maintenance  procedures  call  for  oil  to  be  added 
(if  needed)  after  the  oil  sample  has  been  removed  for  the  oil  analysis  pro¬ 
gram.  The  maintenance  form  on  which  additions  are  recorded  allows  entry  of 
the  number  of  whole  units  (pints,  quarts  or  gallons)  added  to  the  sump;  for 
the  A-10  aircraft  the  unit  used  is  pints.  No  provision  is  made  for  the 
entry  of  fractional  amounts  of  units  being  added  to  the  sump.  Thus  the  oil 
added  values  consist  of  0’s  and  l’s,  indicating  whether  or  not  a  one  pint 
can  of  oil  was  opened  and  used  to  top  off  the  A-10  sump.  In  actual  prac¬ 
tice,  of  course,  the  amount  added  is  generally  a  fraction  of  a  pint,  but 
this  is  not  reflected  in  the  records  available  to  CEMS  IV. 

Among  other  topics,  reference  [6]  discusses  the  use  of  oil  addition 
records  to  estimate  wear  metal  production  rates  for  aircraft  engines.  This 
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approach  is  based  on  several  assumptions  which  may  or  may  not  be  universally 
acceptable  for  all  aircraft  types.  A  tacit  assumption  apparently  made  is 
that  oil  is  lost  through  a  leaking  or  burning  phenomenon  and  that  the 
metallic  contaminants  are  also  lost  in  direct  proportion  to  the  oil  lost. 
That  is,  if  the  iron  contamination  level  (as  measured  by  the  spectrometer) 
is,  say,  10  ppm  and  one  pint  of  oil  has  been  added  to  an  aircraft  sump  which 
holds  10  pints,  reference  [6]  suggests  that  the  iron  contamination  level 
shoud  be  "corrected"  to  read  11  ppm  (a  10%  upward  adjustment  to  account  for 
the  10%  addition  of  fresh  oil).  The  procedure  suggested  in  [6]  also  tacitly 
assumes  a  relatively  accurate  record  made  of  the  amount  of  oil  added  on  each 
occurrence,  rather  than  a  simple  0-1  variable  indicating  whether  a  one  pint 
can  was  opened  and  partially  poured  into  the  sump.  This  suggested  procedure 
then  goes  on  to  suggest  fitting  least  squares  regression  curves  with  the 
corrected  concentration  as  the  dependent  variable  and  number  of  flight  hours 
since  oil  change  as  the  independent  variable.  The  least  squares  regression 
approach  certainly  seems  justified,  to  minimize  the  effects  of  the  spectrom¬ 
eter  errors  of  measurement  of  the  ppm  concentration.  The  use  of  the 
"corrected"  concentration,  though,  does  not  seem  wise  for  the  CEMS  IV  algo¬ 
rithm,  in  part  because  of  the  crude  indication  of  how  much  oil  was  added 
(and  thus  how  much  the  spectrometer  reading  should  be  corrected).  Of  equal 
importance,  it  may  not  be  true  that  the  iron  contamination  is  lost  at  the 
same  rate  as  the  oil  itself.  If  an  evaporative  mechanism  were  causing  the 
oil  loss,  it  seems  possible  the  iron  contamination  may  not  evaporate  at  the 
same  rate  as  the  oil,  if  at  all;  if  this  were  true  the  "corrected"  concen¬ 
trations  would  then  be  too  high. 

The  algorithm  employed  in  the  CEMS  IV  prototype  uses  least  squares  re¬ 
gression  methodology  to  minimize  the  effects  of  the  random  spectrometer 


errors.  It  utilizes  the  number  of  hours  since  oil  change  as  an  independent 
variable  and,  if  oil  addition  records  are  available,  it  lets  least  squares 
itself  determine  the  "corrections"  to  be  applied  to  the  spectrometer  con¬ 
tamination  readings.  This  algorithm  will  now  be  described. 

The  contaminants  monitored  for  the  TF34  engine  in  the  A-10  aircraft  are 
Fe,  Ag,  Al,  Cr,  Cu,  Ni,  Ti  .  These  7  different  contaminants  are  treated 
separately  and  in  the  same  way.  The  following  discussion  refers  to  only  one 
contaminant;  it  is  understood  that  the  same  procedures  are  applied  to  each 
and  that  the  data  from  each  different  serial  number  are  treated  separately. 
Define 

=  Spectrometer  contaminant  reading  for  the  i—  sample. 

T..  =  Number  of  hours  since  oil  change  when  i—  sample  is  taken. 
ai  =  Amount  of  oil  added  to  the  sump,  after  the  i—  sample  is  sent  to 
the  spectrometer. 


X.  =  l  a.  iY./H  ,  the  accumulated  "correction"  to  the  spectrometer 
i  J=2  J-i  J 

reading  in  the  spirit  of  reference  [6];  the  TF34  sump  is  assumed 
to  hold  11  pints,  the  reason  for  the  divisor  of  11. 
e^  =  random  spectrometer  measurement  error  on  the  i—  sample. 

Formally,  the  computations  in  the  algorithm  then  are  consistent  with  the 
assumption  that 

(1)  Yi  ■  80  *  elT(  *  *2*1  *  ei  • 

where  the  e^'s  are  independent  normal  random  variables  with  mean  0  and 
variance  nZ  .  Standard  formulas  for  unweighted  least  squares,  with  two 
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independent  variables  (Ti  and  X. )  are  employed  to  estimate  the  unknown  param- 

2 

eters  8q,  Bp  and  a  .  Detailed  definition  of  these  formulas  is 
provided  in  Appendix  I. 

The  synthetic  variable  X.  in  equation  (1)  is  created  from  the  oil 
added  values  (a^ )  and  the  spectrometer  contaminant  readings;  indeed  Xi  is 
the  "correction"  to  be  applied  to  the  i—  contaminant  reading  by  one  of  the 
procedures  described  in  [6],  granted  the  full  pint  was  added  to  the  sump. 

If  we  were  to  assume  that  B£  =  -1  then  equation  (1)  is  equivalent  to  the 
wear  metal  production  rate  estimation  procedure  given  in  [6],  Use  of  X. 
in  this  way  gives  (1)  an  autoregressi ve  flavor.  It  is  interestiny  to  note 
that  the  use  of  regular  unweighted  least  squares  on  (1)  does  in  fact  yield 
true  least  squares  estimates  of  Bq,  8^  and  »  as  proved  in  Appendix  II. 
With  the  added  assumption  of  normality  of  the  e^  's  ,  the  estimates  used  are 
also  maximum  likelihood. 

The  CEMS  IV  algorithm  applies  two  statistical  tests  to  the  spectrometer 
reading,  in  addition  to  the  T.O.  limits  mentioned  earlier.  These  two  tests 
are  meant  to  be  similar  in  spirit  to  the  T.O.  range  and  T.O.  trend  compari¬ 
sons;  they  differ  from  these  T.O.  comparisons  in  that  they  are  determined  by 
the  historical  data  base  for  the  serial  number  being  analyzed.  These  two 
statistical  tests  are  called  the  Primary  test  (similar  to  the  T.O.  range 
comparison)  and  the  Secondary  test  (similar  to  the  T.O.  trend  comparison). 
After  the  spectrometer  analysis  has  been  made  for  an  incoming  sample 

(say  from  serial  number  1111),  the  computer  calls  up  the  prior  data  base  for 

2 

serial  number  1111,  and  uses  this  data  to  estimate  8^,  Bp  62  and  a  from 
equation  (1).  This  estimated  equation  then  is  used  to  extrapolate  forward 
to  the  hours  since  oil  change  value  (T. )  and  the  value  for  the  new  sample 
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just  analyzed.  This  extrapolation  produces  three  numbers,  labelled  N,  M,  H 
in  Figure  1  (plotted  for  a  case  in  which  no  oil  has  been  added);  these  num¬ 
bers  are  90%,  95%  and  99%  prediction  limits  for  what  contaminant  reading  one 
would  expect  at  this  time,  based  on  the  data  base.  The  current  contaminant 
reading  (just  produced  by  the  spectrometer)  is  labelled  C.  Table  1  gives  the 
result  of  the  primary  test,  based  on  the  relationship  of  C  to  N,  M,  H. 

Table  1 


Current  value 

Primary  test  result 

C  <  N 

Normal 

N  <  C  <  M 

Margi nal 

M  <  C  <_  H 

High 

H  <  C 

Abnormal 

As  pictured  in  Figure  1  the  primary  test  result  would  be  normal. 

The  Secondary  test  adds  the  current  reading  C  to  the  data  base  and 

then  splits  the  data  base  into  two  parts:  the  earliest  75%  of  the  data 

records  versus  the  latest  25%  of  the  data  records.  Equation  (1)  is  then  fit 

separately  to  each  of  these  two  pieces,  giving  two  estimates  of  6^,  the 

rate  of  change  of  the  contaminant.  These  two  estimates  of  are  then 

compared  (using  a  Student's  t  statistic)  to  see  if  it  appears  likely  that 

the  rate  of  change  in  the  later  data  is  larger  than  that  in  the  earlier  part. 

This  test  is  pictured  in  Figure  2,  again  for  a  case  with  no  oil  additions. 

If  the  computed  t  statistic  exceeds  the  95^  quantile  of  the  appropriate 
t  distribution,  the  secondary  test  produces  an  abnormal  recommendation; 

otherwise  the  recommendation  returned  is  normal.  For  the  two  slopes  pictured 

in  Figure  2,  the  Secondary  test  result  is  normal. 


9 


At  this  point  the  computer  has  evaluated  four  recommendations  for  each 
element:  the  T.O.  range  and  trend  values,  as  well  as  the  Primary  and  Second¬ 
ary  statistical  test  results.  It  then  takes  the  worst  case  of  the  T.O.  range 
and  Primary  tests,  and  the  worst  case  of  the  T.O.  trend  and  Secondary  tests 
and  uses  these  as  entries  in  the  T.O.  decision  making  guidance  table.  The 
result  of  this  is  the  computer's  recommendation  for  each  element  for  the  cur¬ 
rent  sample  (standard  A,  B,  C,  E,  F,  H,  J,  P,  S,  T,  or  U  as  used  in  the 
JOAP  laboratory  manual).  This  computer  recommendation  may  be  accepted  (used) 
by  the  OAP  technician  or  may  be  overridden  and  changed  by  him  if  he  feels  that 
to  be  appropriate. 

This  discussion  of  the  Primary  and  Secondary  tests  has  referred  to  a 
historical  data  base  for  each  serial  number.  For  many  aircraft  types,  oil 
changes  are  widely  separated  in  time  and  for  some  types,  the  oil  is  never 
changed.  Thus,  if  all  historical  data  were  maintained,  the  data  base  could 
become  quite  large  for  each  serial  number,  requiring  a  very  large,  accessible 
data  storage  facility  for  each  engine  at  each  base.  Of  equal  importance,  it 
seems  intuitively  reasonable  that  older  data  gets  "stale",  that  ancient  his¬ 
tory  has  little  bearing  on  the  judgment  of  the  current  state  of  an  engine's 
health.  Because  of  this  the  statistical  data  base  for  each  engine  consists 
only  of  the  20  latest  historical  records  which  were  accepted  as  being  normal 
for  the  given  engine.  The  current  spectrometer  reading  makes  the  21- 
record,  the  largest  number  used  in  the  statistical  algorithm;  as  each  new 
record  is  accepted  as  being  normal,  it  replaces  the  oldest  record  in  the  data 
base. 

When  a  new  engine  enters  the  program,  or  an  old  engine  has  an  oil 
change,  for  the  first  7  records  only  the  T.O.  range  and  trend  computations 
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are  used  for  the  computer  recommendation.  When  the  8^-  record  becomes  avail¬ 
able,  the  Primary  test  is  also  applied  (but  the  Secondary  is  not);  this  is 
also  the  case  for  records  9,  10,  11  and  12.  The  13—  and  all  subsequent 
records  are  subjected  to  both  the  Primary  and  Secondary  tests.  The  statisti¬ 
cal  algorithm  has  been  programmed  in  APL  at  the  Naval  Postgraduate  School  and 
in  Fortran  by  the  CEMS  IV  contractor.  Several  data  sets  have  been  used, 
giving  identical  results  from  both  programs. 
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Formulas  for  CEMS  IV  OAP 
H.  Larson,  T.  Jayachandran 
I  Primary  Test  Model  1 

Element  values  Y1 ,  Y^,  ...»  YN _1 

Time  values  T^ ,  T2>  ....  TN_-| 

Computed  "oil  addition"  values  X-j ,  X2,  ....  X^  -j 


Compute  sums  of  squares  and  cross  products 


<2V2 

N-l 


(£Tj)2 

N-l 


d*()2 

N-l 


S  P 


YT 


£Yi 


Tj 


(ZYi)(lT1) 

N-l 


SP 


YX 


’  £Vi 


N-l 


SPTX  = 


N-l 


p 

Compute  denominator  D  =  (SSj) (SSX)  -  (SPyX) 
Compute  coefficients 


6,  -  C(SSX)(SPYT)  -  (SPTX)(SPYX)]  t  D 

b2  *  [(sst)(spyx)  -  (sptx)(spyt)]  v  D 


and  means 


TY. 

?  =  —L 
T  N-l 


T  = 


lh 

N-l 


i.h. 

A  N-l 
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Compute  residual 


RES  =  SSy  -  ^(SP^)  -  B2(SPyx) 

Current  values  for  time,  "oil  addition"  -  T^, 
Current  value  for  element  - 


Compute  increment 
INC  = 


JPT  +  C(Tn-T)2SSx  +  (XN-5i)2SST  -  2(Tn-T)(Xn-X)SPtx]  v  D) 


1/2 


Compute  predicted  element  value 

P  =  Y+B^Tjj-T)  +  62(Xn-X) 

Compute  limits 

L1  =  P  +  t  9(N-4)INC 
L2  -  P  +  t  g5(N-4)INC 
L3  =  p  +  t  gg(N-4)INC 

where  t  g(N-4),  t  g5(N-4),  t  gg(N-4)  are  quantiles  of  the  t-distribution, 

N-4  degrees  of  freedom 

Sample  reading  is  normal  if  <  L1 

Sample  reading  is  marginal  if  <  YN  <  L2 

Sample  reading  is  high  if  L?  <  ^  1  l-3 
Sample  reading  is  abnormal  if  <  YN 
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Compute  limits 

L1  =  P  +  t  g(N-3) INC 

l2  =  p  +  t  g5(N-3)INC 

L3  =  P  +  t  gg(N-3) INC 

where  t  g(N-3),  t  95(N-3)  t  gg(N-3)  are  quantiles  of  the  t-distribution, 
N-3  degrees  of  freedom. 

Sample  reading  is  normal  if  Y^  <  L-j 

Sample  reading  is  marginal  if  L-|  <  Y^  £  L2 

Sample  reading  is  high  if  L2  <  YN  -  L3 

Sample  reading  is  abnormal  if  I-3  <  Y^ 

III  Secondary  test  model  1 
Element  values  Y-j,  Y2>  ...»  Y^ 

Time  values  T-j,  T2,  .  TN 

Computed  "oil  addition"  values  X-j,  X2>  ...»  X^ 

Data  is  split  into  2  segments 

Segment  1  Earliest  N1  =  [.75N]  values 
Segment  2  Remaining  N2  *  N  -  N-j  values 
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For  each  segment  separately,  j  =  1,2 
Compute  sums  of  squares  and  cross  products 


SS 

^Yj  ZYi  N. 


r  2  dV‘ 

SSTj  -  '  -ft 


2  CZX. )' 
ssxj  -  -  -NT 


r  (IV  (IT  1-) 

SPYTj  =  ^YiTi  "  N. 


p  (ZY-JCZX.) 

SPYXj  =  ^YiXi  ‘  N. 

J 

SP  .MV 

^  iXj  i'iAi  N, 


Compute  denominators  Dj  =  (SS^JCSS^j)  -  (SPj^j) 


Compute  coefficients 


V  =  ^(SSXj  ^  (SPYTj^  '  (SPTXj)(SPYXj^  :  Dj 


02j  -  [(SSTj)(SPYXj.)  -  (SPTXj.)(SPyTj)]  :  Dj 


IYi  ITi  Ix< 
and  means  ?  j  =  “NT  *  Tj  =  “NJ  *  *j  =  1i7 


Compute  residuals 


Compute  test  value 


TEST  »  (e12-6n)  t 


Sample  trend  is  normal  if  TEST  £  t  gg(N-6) 

Sample  trend  is  abnormal  if  t  gg (N-6 )  <  TEST 

where  t  g9(N-6)  is  a  t-distribution  quantile,  N-6  degrees  of  freedom. 


IV  Secondary  test  Model  2 

Element  values  ,  Y2,  ....  Y^ 

Time  values  T1 ,  T2>  ....  T^ 


Data  is  split  into  2  segments 

Segment  1  Earliest  N1  =  [.75N]  values 
Segment  2  Remaining  N2  =  N-N^  values 
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*  a  sp^j 

Compute  coefficients  8j  -  ss^ 


j  -  1.2 


,  -  ^r1  .  T,  »  .  J  =  ^ 


and  means  Y.  =  -n-  »  1  -i  n. 

J  wj  J  j 

Compute  residuals 

RES.  -  SSYj  -  ijSPYTj 


j  *  1.2 


Compute  test  value 


TEST  =  C&i\) 


RES1  +  R £S2 


TM 


*  ssT2) 


1/2 


Sample  trend  is  normal  if  TEST  <  t  99(N-4) 

Sample  trend  is  abnormal  if  *.99^'^  <  TEST 


V.  Residual  SS  =  0?  test 
Residual  SS  is  RES  in  II 


VI.  Residual  SS  *  0?  test 
Residual  SS  is  RES  in  I 
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VII.  Res  i  dual  -j  =  Residua^  =  0?  test 
Values  are  RES^  and  RES^  in  IV 


VIII.  Residual-j  =  Residua^  =  0?  test 
Values  are  RES-j  and  RES^  in  III 


IK.  T- ,X .  correlation  -  1?  test 

With  definitions  in  I  the  correlation  is 


r 


/5$j  SST 


compare  with  1. 


When  the  data  <s  split  into  2  segments,  compute  the  correlation 
separately  for  each 
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X  Oil  addition  computation 


The  values  for  ,  Xg,  ....  XN  are  computed  using  only  the  data  in  the 
current  window  and  do  not  depend  on  earlier  values. 

Formula  used:  X-|  =  0 

Vi  *  xi  *  a(Vi/D 

where  Y.  is  the  ith  element  reading,  a^  is  the  ith  “oil"  value 
(generally  0  or  1),  0  is  the  sump  capacity  (unrelated  to  D  used  in  I). 

Example 
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Appendix  II 

For  any  fixed  number,  n  ,  of  data  points  the  CEMS  IV  algorithm  is 
consistent  with 

(1)  Y.  =  bq  +  6lTi  +  p2Xi  +  ei  *  1  s  1»2 . n 

where  Y,.  =  spectrometer  contaminant  reading,  sample  i 

T,.  =  time  since  oil  change,  sample  i 


i 

X.  =  y  a.  , Y . / 1 1  ,  where  a.  =  amount  of  oil  added,  sample's 
i  j  _2  J "  J  J 

e^  =  random  spectrometer  additive  noise,  sample  i  . 

In  standard  least  squares  notation,  define  Y  to  be  the  nxl  vector  of 
contaminant  readings,  define  X  to  be  the  nx3  matrix  whose  first  column  is 
all  l's  ,  whose  second  column  contains  the  n  -  T\  values  and  whose  third 
column  contains  the  synthetic  X..  values,  and  define  6  to  be  the  3x1 
vector  with  components  BQ,  Bp  Bg  •  Then  is  we1^  known  that  the  simple 
unweighted  least  squares  estimate  for  B  is  given  by 

B  =  (X'X)'1  X'Y 

where  X'  denotes  the  transpose  of  X  . 

To  see  the  form  of  the  least  squares  estimates  of  80,  &2  and  b2  Tor 
equation  (1),  let  A  represent  the  lower  triangular  n  x  n  matrix  whose 
first  row  is  all  0's,  whose  1—  diagonal  and  all  elements  below  it  in  that 
column  are  a^  if  a^  =  0  ,  and  whose  remaining  elements  are  all  0  .  A 
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little  reflection  shows  that  the  elements  of  the  n  x  1  vector  V  =  AY  then 
are  the  values  for  Xp  X2,...,  Xn,  the  synthetic  oil  added  variable.  Addi¬ 
tionally,  let  U  represent  the  n  x  2  matrix  consisting  of  the  first  2 
columns  of  the  matrix  X  ,  let  y  be  the  2x1  vector  with  components 
60  and  B1  (the  first  two  components  of  B)  and  let  e  be  the  n  x  1  vector 
with  components  ep  e2»...  en  .  Then  (1)  can  be  written 


Y  =  Uy  +  $2^  +  e 

=  Uy  +  B2AY  +  e  » 

rearranging  terms,  this  also  can  be  written 

Y  -  62AY  =  Uy  +  e 
or 

(I  -  B2A)Y  =  Uy  +  e  . 

The  unweighted  least  squares  estimates  are  by  definition  the  parameter  values 
which  minimize 

Q  =  e'e  =  ((I  -  62^  ‘  Uy)'((I  -  B2A) Y  -  Uy) 

=  Y 1 ( I  -  fi2A)'(I  -  B2A)Y  -  2y'U'(I  -  62A)Y 
+  y 'U 'Uy  . 


30 

Letting  ^  represent  the  vector  whose  components  are 
respectively,  it  is  easy  to  see  that 


and 
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from  which  it  is  evident  that 


Y  =  (U,U)-1U‘ ( I  -  e2A)Y  . 

Remembering  that  B2  is  a  scalar,  a  little  algebra  shows  that 

||  »  -  Y * (A '  +  A)Y  +  2  e2Y‘A'AY  +  2  y'U'AY  ; 

setting  this  equal  to  zero,  and  substituting  the  above  solution  for  y  then 
easily  gives 


Y' (i(A'+A))Y  -  Y'U(U'U)"1U'AY 

g  —  £ 

2  Y 1 A  * (I  -  U(U,U)“1U' )AY 

as  the  least  squares  estimate  for  32  . 

That  these  solutions  for  6g,  Bj  (y)  and  &2  are  in  fact  the  same  as 
the  earlier  quoted  simple  unweighted  least  squares  values  is  easily  seen  by 
realizing  that 

e  =  Y  -  Uy  -  e2V 
• v  - 
=  y  -  xe 

since  the  X  matrix  is  (U | AY )  and  the  6  vector  is  •  The  well  known 

solution  B  =  ( X ' X ) ~ 1X ' Y  then  must  consist  of  the  subvectors  y  and  B2 

defined  above.  If  we  also  assume  that  e  is  multivariate  normal  with  mean 

2 

vector  0  and  covariance  matrix  a  I  ,  it  is  immediately  apparent  that  y 


and  e2  are  also  the  maximum  likelihood  estimates  and  the  maximum  likelihood 

2 

estimate  for  a  is 


-2  _  Y'Y  -  6‘X'Y 
o  -  - - - 


The  CEMS  IV  algorithm  actually  uses 


s 


2 


no 


2 


n  -  3 


as  the  estimate  for 


2 

cr 
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